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^ (54) Title: PROTEIN LABELLING WITH 06-ALKYLGUANINE-DNA ALKYLTRANSFERASE 

o 

® (57) Abstract: The present invention relates to methods of transferring a label from suitable substrates to 06-alkylguanine-DNA 
alkyltransferase (AGT) fusion proteins, to suitable fusion proteins, to suitable variants of AGT, and to novel labelled fusion proteins 
^ obtained. A protein of interest is incorporated into an AGT fusion protein, the AGT fusion protein is contacted with an AGT substrate 
^ carrying a label, and the AGT fusion protein is detected and/or manipulated using the label in a system designed for recognising 
^ and/or handling the label. 
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Protein labelling with 0 6 -Alkylguanine-DNA Alkyltransferase 

Field of the Invention 

5 The present invention relates to methods of transferring a label from suitable substrates to 
0 8 -alkylguanine-DNA alkyltransferase fusion proteins, and to novel labelled fusion proteins 
obtained. 

Background of the invention 

10 

The mutagenic and carcinogenic effects of electrophiles such as N-methyl-N-nitrosourea are 
mainly due to the O e -alkylation of guanine in DNA. To protect themselves against DNA- 
alkylation, mammals and bacteria possess a protein, O e -alkylguanine-DNA alkyltransferase 
(AGT) which repairs these lesions. AGT transfers the alkyl group from the position 0-6 of 

1 5 alkylated guanine and guanine derivatives to the mercapto group of one of its own cysteines, 
resulting in an irreversibly alkylated AGT. The underlying mechanism is a nucleophilic 
reaction of the S N 2 type which explains why not only methyl groups, but also benzylic groups 
are easily transferred. As overexpression of AGT in tumour cells is the main reason for 
resistance to alkylating drugs such as procarbazine, dacarbazine, temozolomide and bis-2- 

20 chloroethyl-N-nitrosourea, inhibitors of AGT have been proposed for use as sensitisers in 
chemotherapy (Pegg et a/., Prog Nucleic Acid Res Mol Biol 51: 167-223, 1995). 

DE 199 03 895 discloses an assay for measuring levels of AGT which relies on the reaction 
between biotinylated O e -alkylguanine derivatives and AGT which leads to biotinylation of the 
25 AGT. This in turn allows the separation of the AGT on a streptavidin coated plate and its 
detection, e.g. in an ELISA assay. The assay is suggested for monitoring the level of AGT in 
tumour tissue and for use in screening for AGT inhibitors. 

Damoiseaux et a/., ChemBiochem. 4: 285-287, 2001, disclose modified 0 6 -alkylated guanine 
30 derivatives incorporated into oligodeoxyribonucleotides for use as chemical probes for 
labelling AGT, again to facilitate detecting the levels of this enzyme in cancer cells to aid in 
research and in chemotherapy, 

PCT/GB02/01636 discloses a method for detecting and/or manipulating a protein of interest 
35 wherein the protein is fused to AGT and the AGT fusion protein contacted with an AGT 
substrate carrying a label, and the AGT fusion protein detected and optionally further 
manipulated using the label. Several AGT fusion proteins to be used, general structural 
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principles of the AGT substrate and a broad variety of labels and methods to detect the label 
useful in the method are described. 

Summary of the invention 

5 

The invention relates to a method for detecting and/or manipulating a protein of interest, 
wherein the protein of interest is incorporated into an AGT fusion protein, the AGT fusion 
protein is contacted with a suitable AGT substrate carrying a label, and the AGT fusion 
protein is detected or manipulated or both manipulated and detected in any order using the 
10 label in a system designed for recognising and/or handling the label. 

The protein of interest according to the invention is selected from the group consisting of 
enzymes, DNA-binding proteins, transcription regulating proteins, membrane proteins, 
nuclear receptor proteins, nuclear localization signal proteins, protein cofactors, small 
1 5 monomeric GTPases, ATP-binding cassette proteins, intracellular structural proteins, 
proteins with sequences responsible for targeting proteins to particular cellular 
compartments, proteins generally used as labels or affinity tags, and domains or subdomains 
of the aforementioned proteins, excluding the major head protein D of phage X (gpD) and 
those particular proteins of interest disclosed in PCT/GB02/01636 (WO 02/083937). 

20 

The AGT fusion protein may consist of one or more, e.g. one, two or three, proteins of 
interest fused to AGT at the N-, C- or N- and C-terminal of AGT. AGT may be human AGT 
(hAGT), other mammalian AGT, or a variant of a wild-type AGT with one or more amino acid 
substitution, deletion or addition. 

25 

The invention relates also to the novel AGT fusion proteins as such, and in particular to 
labelled AGT fusion proteins obtained in the method of the invention comprising an AGT 
fusion protein covalently bound to a substrate carrying a label. 

30 Detailed description of the invention 

In the present invention a protein or peptide of interest is fused to an O e -alkylguanine-DNA 
alkyltransferase (AGT). The protein or peptide of interest may be of any length and both with 
and without secondary, tertiary or quaternary structure, and preferably consists of at least 
35 twelve amino acids and up to 2000 amino acids, preferably between 50 and 1 000 amino 
acids. 
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The protein of interest according to the invention is selected from the group consisting of 



enzymes, e.g. 

transferases (EC 2), more specific a transferase transferring an alkyl or aryl group other than 
5 a methyl group (EC 2.5), in particular a glutathione transferase (EC 2.5.1 .18), 

or a kinase, that is a transferase transferring phosphorus containing groups (EC 2.7), in 
particular a kinase with an alcohol group as acceptor (EC 2.7.1), such as a protein kinase 
with serine and threonine as the phosphorylated target sites in the substrate protein, e.g. 
casein kinase from yeast (EC 27.1 .37), or a tyrosine protein kinase (EC 2.7.1.1 12); 

1 0 or e.g. oxidoreductases (EC 1 ), more specific an oxidoreductase acting on peroxide as 
acceptor (EC 1 .1 1 ), in particular the enzyme cytochrome C peroxidase (EC 1.11.1.5); 
or e.g. hydrolases (EC 3), more specific a hydrolase acting on an ester bond (EC 3.1), in 
particular a phosphoric monoester hydrolase (EC 3.1 .3), such as a protein phosphoric 
monoester hydrolase; or a hydrolase hydrolyzing peptide bonds, also known as peptidase or 

15 protease (EC 3.4), in particular a caspase; 

DNA-binding proteins, more specific transcription repressor proteins which are protein factors 
inhibiting mRNA synthesis, specifically a protein factor inhibiting mRNA synthesis in £ coli, 
in particular the DNA-binding domain of the LexA protein; 

20 

transcription regulating proteins, more specific transcription repressor proteins, in particular 
transcription repressor proteins containing a tryptophan/aspartate repeat structure, 
specifically the S. cerevisiae transcription repressor Tup1 ; 

25 membrane proteins, e.g. membrane proteins showing at least one transmembrane helix, 
more specific membrane proteins from the endoplasmatic reticulum (ER) membrane, in 
particular membrane proteins being active in protein translocation into the ER, such as the 
ER transmembrane protein Sec62; 

or e.g. a protein from the family of 7-transmembrane helix (7-TM) proteins, more specific a 
30 7-TM protein being a G-protein coupled receptor (GPCR), in particular those that bind 
macromolecular ligands with a molecular weight above 1 kDa, such as a mammalian, e.g. 
human, neurokinin-1 -receptor (NK1 ); 

or e.g. transmembrane ion channel proteins from the cell membrane, in particular ligand 
gated ion channel proteins, more specific a ligand gated ion channel protein sensitive to 
35 serotonin, such as the serotonin receptor 5-HT3; 

or e.g. membrane receptors other than ion channels and G-protein coupled receptors; 

or e.g. peroxisomal membrane proteins, in particular from yeast, such as the protein Pex15; 
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nuclear receptor proteins, e.g. nuclear receptor proteins from the family of transcription 
factors, more specific nuclear receptor proteins from the family of ligand inducible 
transcription factors, in particular a nuclear receptor from the family of steroid, e.g. estrogen, 
5 receptors, such as the human estrogen receptor hER; 

nuclear localization signal proteins, such as the nuclear localization signal from the Simian 
Virus 40 (SV40); 

10 protein cofactors, e.g. proteins containing an ubiquitin sequence in their genetic structure; 

small monomeric GTPases, more specific membrane-adherent small monomelic GTPases, 
e.g. a member of the Ras family, 

15 ATP-binding cassette (ABC) proteins, e.g. a multiple drug resistance protein; 

intracellular structural proteins, more specifically proteins of the cytoskeleton, more 
specifically human cytoplasmic p-actin; 

20 proteins with sequences responsible for targeting proteins to particular cellular 
compartments, e.g. to the Golgi apparatus, the endoplasmatic reticulum (ER), the 
mitochondria, the plasma membrane or the peroxisome; 

proteins generally used as labels or affinity tags, e.g. fluorescent proteins giving a fluorescent 
25 signal on excitation with UV or visible radiation, in particular fluorescent proteins from the 
family known as green fluorescent proteins (GFP), such as the fluorescent protein known as 
enhanced cyano fluorescent protein (ECFP); 

and domains or subdomains of the aforementioned proteins. 

30 

Furthermore, the protein of interest according to the invention is selected according to 
source. In particular, proteins of interest are those present in bacterial species, e.g. 
salmonella, more specific salmonella typhi or salmonella typhimurium, mycobacteria, more 
specific mycobacterium tuberculensis, or staphylococci, more specific staphylococcus 
35 aureus, or from a viral source, e.g. human immunodeficiency virus (HIV), human influenza 
virus, or hepatitis virus. 
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Preferred groups of proteins of interest are, for example, 

receptors, e.g. membrane receptors, in particular 7-TM receptors (GPCRs), receptors with 
enzymatic activity, in particular of a kinase type which might require dimerization to be active, 
5 ion channels, and membrane proteins involved in virus docking and virus entering cells, or 
e.g. intracellular receptors, in particular receptors for compounds crossing the membrane, 
such as receptors for steroid hormones; 

extracellular signaling molecules and signaling factors, e.g. interleukins, growth factors, 
10 releasing hormones, prostaglandins, insulin and glucagon; 

proteins of intracellular signal cascades, e.g. enzymes and cofactors involved in 
phosphatidinyl-inositol signaling, and in cAMP and cGMP generation, membrane adherent 
and free kinases, kinase-kinases as well as phosphatases, and the terminally activated or 
1 5 deactivated enzymes of intracellular signaling cascades, in particular those activating 
caspases; 

hormones, and enzymes involved in the synthesis, liberation, activation, receptor activity and 
desactivation of hormones; 

20 

membrane surface markers correlating with the cell status, e.g. alpha-fetoprotein; 

and proteins involved in blood pressure control and heart function, e.g. ACE inhibitors, 
kidney receptors and kidney channel proteins, and cardiac potassium channel proteins. 

25 

Excluded from the scope of the claims of the present invention are fusion proteins with the 
major head protein D of phage X (gpD), and with protein of interest disclosed in 
PCT/GB02/01636 (WO 02/083937), in particular MHHHHHHSSA-hAGT, the fusion protein of 
the short peptide His 6 further comprising methionine (M), serine (S) and alanine (A), hAGT- 

30 DHFR-HA, the fusion protein of hAGT, a short linker peptide, dihydrofolate reductase from 
mouse and the Ha epitope; V5-NLS-B42-hAGT, the fusion protein of the V5 epitope, the 
SV40 large T antigen nuclear localization sequence, the artificial transcriptional activator 
B42, a linker peptide and hAGT; hAGT-HA-Ura3, the fusion protein of hAGT, the Ha epitope 
and the yeast enzyme orotic acid decarboxylase Ura3; and hAGT-SSN6, the fusion protein of 

35 hAGT, a short linker peptide and a yeast repressor of DNA transcription named SSN6. 
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Disclosed are fusion proteins made from wild-type human AGT (hAGT), other mammalian 
AGT, e.g. rat or mouse AGT, or variants of such AGT DNA on the one side and proteins of 
interest (as listed above) encoding sequences either attached to the N-terminal (N) or the C- 
terminal (C) side or N- and C-terminal side of the AGT DNA sequence, leading to the fusion 
5 proteins of the invention. Fusion proteins may further contain suitable linkers, e.g. linkers 
which may be susceptible to enzyme cleavage under suitable conditions, between AGT and 
the protein of interest and/or between two proteins of interest in a fusion protein. Examples 
of such linkers are those which are cleavable at the DNA stage by suitable restriction 
enzymes, e.g. AGATCT cleavable by Bgl II t and/or linkers cleavable by suitable enzymes at 
10 the protein stage, e.g. tobacco etch virus NIa (TEV) protease. 

Fusion proteins may be expressed in prokaryotic hosts, preferably E colt\ or eukaryotic 
hosts, e.g. eubacteria, yeast, insect cells or mammalian cells. 

15 The O e -alkylguanine-DNA alkyltransferase (AGT) has the property of transferring a label 
present on a substrate to one of the cysteine residues of the AGT forming part of a fusion 
protein. In preferred embodiments, the AGT is a known human 0 6 -alkylguanine-DNA 
alkyltransferase, hAGT. Murine or rat forms of the enzyme are also considered provided 
they have similar properties In reacting with a substrate like human AGT. In the present 

20 invention, O e -alkylguanine-DNA alkyltransferase also includes variants of a wild-type AGT 
which may differ by virtue of one or more, e.g. one, two, three or four, amino acid 
substitutions, deletions or additions, but which still retain the property of transferring a label 
present on a substrate to the AGT part of the fusion protein. AGT variants may be obtained 
by chemical modification using techniques well known to those skilled in the art. AGT 

25 variants may preferably be produced using protein engineering techniques known to the 
skilled person and/or using molecular evolution to generate and select new O e -alkylguanlne- 
DNA alkyltransferases. Such techniques are e.g. saturation mutagenesis, error prone PCR 
to introduce variations anywhere in the sequence, DNA shuffling used after saturation 
mutagenesis and/or error prone PCR, or family shuffling using genes from several species. 

30 

With the aid of the phage display method mutants are found with significantly increased 
activity towards 0 6 -benzylguanine and AGT substrates of the invention. hAGT can be 
functionally displayed as a fusion protein with the major head protein D on phage X, and the 
unusual mechanism of hAGT can be used to select phage X displaying hAGT out of mixtures 
35 of wild-type phage X (Damoiseaux ef a/., ChemBiochem. 4: 285-287, 2001 ). hAGT may also 
be functionally displayed on filamentous phage as a fusion protein with the phage capsid 
protein pill. 
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ln the structure of hAGT bound with O e -benzylguanine in its active site, four amino acids are 
in proximity of either the benzyl ring (Pro140, Ser159 t Gly160), or could make contact with 
the N9 of the nucleobase (Asn157). Mutations at position Pro140 and Gly160 have 
5 previously been shown to affect the reaction of hAGT with O e -benzylguanine (Xu-Welliver et 
a/., Biochemical Pharmacology 58: 1279-85, 1999): A proline at position 140 is believed to 
be essential for its interaction with the benzyl ring, and the mutation Gly160Trp has been 
shown to increase the reactivity of hAGT towards O e -benzylguanine. Particular variants 
considered in this invention are those with Phe or Met in position 140; Gly, Pro, Arg or Trp at 
1 0 position 1 57, in particular Gly; Glu, Asn, Pro or Gin at position 1 59, in particular Glu; and Ala, 
Trp, Cys or Val at position 160, in particular Trp. The preferred variants are the one wherein 
Asn 157 is replaced by Gly and Ser 159 by Glu, and the one wherein Gly 160 is replaced by Ala or 
Trp. Most preferred is the variant wherein Asn 157 is replaced by Ser, Ser 159 by His, and Gly 16C 
by Asn. 

15 

The fusion protein comprising protein of interest and an O e ~alkylguanine-DNA alkyl- 
transferase (AGT) is contacted with a particular substrate having a label. Conditions of 
reaction are selected such that the AGT reacts with the substrate and transfers the label of 
the substrate. Usual conditions are a buffer solution at around pH 7 at room temperature, 
20 e.g. around 25°C. However, it is understood that AGT reacts also under a variety of other 
conditions, and those conditions mentioned here are not limiting the scope of the invention. 

AGT irreversibly transfers the alkyl group from its substrate, O e -alkylguanine-DNA, to one of 
its cysteine residues. A substrate analogue that rapidly reacts with hAGT is 0 6 -benzyl- 
25 guanine, the second order rate constant being approximately 10 3 sec* 1 M" 1 . Substitutions of 
O e -benzylguanine at the C4 of the benzyl ring do not significantly affect the reactivity of 
hAGT against 0 6 -benzylguanine derivatives, and this property has been used to transfer a 
label attached to the C4 of the benzyl ring to AGT. 

30 The label part of the substrate can be chosen by those skilled in the art dependent on the 
application for which the fusion protein is intended. After contacting the fusion protein 
comprising AGT with the substrate, the label is covalently bonded to the fusion protein. The 
labelled AGT fusion protein is then further manipulated and/or detected by virtue of the 
transferred label. 

35 

Under "manipulation" any physical or chemical treatment is understood. For instance 
manipulation may mean isolation from cells, purification with standard purification 
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techniques, e.g. chromatography, reaction with chemical reagents or with the binding partner 
of a binding pair, in particular if the binding partner is fixed to a solid phase, and the like. 



of the labelled fusion protein. If the labelled fusion protein is both manipulated and detected, 
detection may be before or after manipulation, or may occur during manipulation as defined 
herein. 

The particular AGT substrates are compounds of the formula 1 



wherein R r R 2 is a group recognized by AGT as a substrate; 
X is oxygen or sulfur; 

R3 Is an aromatic or a heteroaromatic group, or an optionally substituted unsaturated alkyl, 
cycloalkyl or heterocyclyl group with the double bond connected to CH 2 ; 
R4 is a linker; and 

L is a label, a plurality of same or different labels, a bond connecting R4 to R1 forming a cyclic 
substrate, or a further group -R 3 -CH 2 -X-R r Rz 

In a group RrR 2 , the residue R1 is preferably a heteroaromatic group containing 1 to 5 
nitrogen atoms, recognized by AGT as a substrate, preferably a purine radical of the 
formula 2 



Such manipulation may be dependent on the label L, and may occur in addition to "detection" 




X 





wherein R 2 is hydrogen, alkyl of 1 to 10 carbon atoms, or a saccharide moiety; 
R 5 is hydrogen, halogen, e.g. chloro or bromo, trifluoromethyl, or hydroxy; and 
R 6 is hydrogen, hydroxy or unsubstituted or substituted amino. 
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lf R5 or Re is hydroxy, the purine radical is predominantly present in its tautomeric form 
wherein a nitrogen adjacent to the carbon atom bearing Rs or Re carries a hydrogen atom, 
the double bond between this nitrogen atom and the carbon atom bearing R5 or R 6 is a single 
bond, and R 5 or Re is. double bonded oxygen, respectively. 

5 

A substituted amino group Re is lower alkylamino of 1 to 4 carbon atoms or acylamino, 
wherein the acyl group is lower alkylcarbonyl with 1 to 5 carbon atoms, e.g. acetyl, propionyl, 
n- or isopropylcarbonyl, or n-, iso- or tert-butylcarbonyl, or arylcarbonyl, e.g. benzoyl. 

10 If Re is unsubstituted or substituted amino and the residue X connected to the bond of the 
purine radical is oxygen, the residue of formula 2 is a guanine derivative. 

A saccharide moiety R 2 is a saccharide monomer or oligomer connected with a spacer of 
variable length to the N 9 position of the guanine base. The spacer in this context is an alkyl 
15 chain preferably from 1 to 15 carbon atoms, a polyethylene glycol spacer consisting of 1 to 
200 ethylene glycol units, an amide group -CO-NH-, an ester group -COO-, an alkylene 
group -CH=CH- or a combination of alkyl chain, polyethylene glycol group, amide group, 
ester group, and alkylene group. 

20 In the context of this invention, a saccharide moiety R 2 further includes a p-D-2'-deoxyribosyl, 
or a p-D-2-deoxyribosyl being incorporated into a single stranded oligodeoxyribonucleotide 
having a length of 2 to 99 nucleotides, wherein the guanine derivative R1 occupies any 
position within the oligonucleotide sequence. 

25 In another preferred embodiment of the invention the group R r R 2 is a 8-azapurine radical, 
wherein the moiety OR 5 of the radical of formula 2 is replaced by nitrogen, and R 2 and Re 
have the meaning as defined under formula 2. 

X is preferably oxygen. 

30 

R3 as an aromatic or a heteroaromatic group, or an optionally substituted unsaturated alkyl, 
cycloalkyl or heterocyclyl group is a group sterically and electronically accepted by AGT (in 
accordance with its reaction mechanism) which allows the covalent transfer of the R3-R4-L 
unit to the fusion protein. In a R3-R4-L unit, R4-L may also have the meaning of a plurality of 
35 same or different linkers R4 carrying a plurality of same or different labels L 



WO 2004/031404 PCT/EP2003/0 10859 

-10- 

R 3 as an aromatic group is preferably phenyl or naphthyl, in particular phenyl, e.g. phenyl 
substituted by R4 in para or meta position. 

A heteroaramatic group R 3 is a mono- or bicyclic heteroaryl group comprising zero, one, two, 
5 three or four ring nitrogen atoms and zero or one oxygen atom and zero or one sulfur atom, 
with the proviso that at least one ring carbon atom is replaced by a nitrogen, oxygen or sulfur 
atom, and which has 5 to 12, preferably 5 or 6 ring atoms; and which in addition to carrying a 
substituent R4 may be unsubstituted or substituted by one or more, especially one, further 
substituent selected from the group consisting of lower alkyl, such as methyl, lower alkoxy, 

10 such as methoxy or ethoxy, halogen, e.g. chlorine, bromine or fluorine, halogenated lower 
alkyl, such as trifluoromethyl, or hydroxy. Preferably the heteroaryl group R 3 is triazolyl, 
especially 1 -triazolyl, carrying the further substituent R4 in the 4- or 5-position, tetrazolyl, 
especially 1 -tetrazolyl, carrying the further substituent R4 in the 4- or 5-position or 2-tetrazolyl 
carrying the further substituent in 5 position, isoxazolyl, especially 3-isoxazolyl carrying the 

1 5 further substituent in 5 position, or 5-isoxazolyl, carrying the further substituent in 3 position, 
or thienyl, especially 2-thienyl, carrying the further substituent R4 in 3-, 4- or 5-position, 
preferably 4-position, or 3-thienyl, carrying the further substituent R 4 in 4-position. 

An optionally substituted unsaturated alkyl group R 3 is 1-alkenyl carrying the further 
20 substituent R4 in 1 or 2 position, preferably in 2 position, or 1-alkynyl. Substituents 

considered in 1-alkenyl are e.g. lower alkyl, e.g. methyl, lower alkoxy, e.g. methoxy, lower 
acyloxy, e.g. acetoxy, or halogenyl, e.g. chloro. 

An optionally substituted unsaturated cycloalkyl group is a cycloalkyl group with 3 to 7 carbon 
25 atoms unsaturated in 1 position, e.g. 1-cyclopentyl or 1-cyclohexyl, carrying the further 

substituent R4 in any position. Substituents considered are e.g. lower alkyl, e.g. methyl, lower 
alkoxy, e.g. methoxy, lower acyloxy, e.g. acetoxy, or halogenyl, e.g. chloro. 

An optionally substituted unsaturated heterocyclyl group has 3 to 12 atoms, 1 to 5 
30 heteroatoms selected from nitrogen, oxygen and sulfur, and a double bond in the position 

connecting the heterocyclyl group to methylene CH 2 . Substituents considered are e.g. lower 

alkyl, e.g. methyl, lower alkoxy, e.g. methoxy, lower acyloxy, e.g. acetoxy, or halogenyl, e.g. 

chloro. In particular, an optionally substituted unsaturated heterocyclyl group Is a partially 

saturated heteroaramatic group as defined hereinbefore for a heteroaramatic group R 3 . An 
35 example of such a heterocyclyl group is isoxazolidinyl, especially 3-isoxazolidinyl carrying the 

further substituent in 5 position, or 5-isoxazolidinyl, carrying the further substituent in 3 

position. 
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A linker group R4 is preferably a flexible linker connecting a label L or a plurality of same or 
different labels L to the substrate. Linker units are chosen in the context of the envisioned 
application, i.e. in the transfer of the substrate to a fusion protein comprising AGT. They also 
5 increase the solubility of the substrate in the appropriate solvent. The linkers used are 

chemically stable under the conditions of the actual application. The linker does not interfere 
with the reaction with AGT nor with the detection of the label L, but may be constructed such 
as to be cleaved at some point in time after the reaction of the compound of formula 1 with 
the fusion protein comprising AGT. 

10 

A linker R4 is a straight or branched chain alkylene group with 1 to 300 carbon atoms, 
wherein optionally 

(a) one or more carbon atoms are replaced by oxygen, in particular wherein every third 
carbon atom is replaced by oxygen, e.g. a poylethyleneoxy group with 1 to 100 ethyleneoxy 

15 units; 

(b) one or more carbon atoms are replaced by nitrogen carrying a hydrogen atom, and the 
adjacent carbon atoms are substituted by oxo, representing an amide function -NH-CO-; 

(c) one or more carbon atoms are replaced by oxygen, and the adjacent carbon atoms are 
substituted by oxo, representing an ester function -O-CO-; 

20 (d) the bond between two adjacent carbon atoms is a double or a triple bond, representing a 
function -CH=CH- or -CEC-; 

(e) one or more carbon atoms are replaced by a phenylene, a saturated or unsaturated 
cycloalkylene, a saturated or unsaturated bicycloalkylene, a bridging heteroaromatic or a 
bridging saturated or unsaturated heterocyclyl group; 
25 (f) two adjacent carbon atoms are replaced by a disulfide linkage -S-S-; 

or a combination of two or more, especially two or three, alkylene and/or modified alkylene 
groups as defined under (a) to (f) hereinbefore, optionally containing substituents. 

Substituents considered are e.g. lower alkyl, e.g. methyl, lower alkoxy, e.g. methoxy, lower 
30 acyloxy, e.g. acetoxy, or halogenyl, e.g. chloro. 

Further substituents considered are e.g. those obtained when an a-amino acid, in particular a 
naturally occurring a-amino acid, is incorporated in the linker R4 wherein carbon atoms are 
replaced by amide functions -NH-CO- as defined under (b). In such a linker part of the 
35 carbon chain of the alkylene group R4 is replaced by a group -(NH-CHR-CO) n - wherein n is 
between 1 and 100 and R represents a varying residue of an a-amino acid. 



WO 2004/031404 



PCT/EP2003/010859 



-12- 

A further substituent is one which leads to a photocleavable linker e.g. an o-nitrophenyl 
group. In particular this substituent o-nitrophenyl is located at a carbon atom adjacent to an 
amide bond, e.g. In a group -NH-CO-CH 2 -CH(o-nitrophenyl)-NH-CO-, or as a substituent in 
a polyethylene glycol chain, e.g. in a group -0-CH 2 -CH(o-nitrophenyl)-0-. Other 
5 photocleavable linkers considered are, e.g. phenacyl, alkoxybenzoin, benzylthioether and 
pivaloyl glycol derivatives. 

A phenylene group replacing carbon atoms as defined under (e) hereinbefore is e.g. 1,2-, 
1,3-, or preferably 1,4-phenylene. A saturated or unsaturated cycloalkylene group replacing 

10 carbon atoms as defined under (e) hereinbefore is e.g. cyclopentylene or cyclohexylene, or 
also cyclohexylene being unsaturated e.g. in 1- or in 2-position. A saturated or unsaturated 
bicycloalkylene group replacing carbon atoms as defined under (e) hereinbefore is e.g. 
bicyclo[2.2.1]heptylene or bicyclo[2.2.2]octylene, optionally unsaturated in 2-position or 
doubly unsaturated in 2- and 5-position. A heteroaromatic group replacing carbon atoms as 

1 5 defined under (e) hereinbefore is e.g. triazolidene, preferably 1 ,4-triazolidene, or 

isoxazolidene, preferably 3,5-isoxazolidene. A saturated or unsaturated heterocyclyl group 
replacing carbon atoms as defined under (e) hereinbefore is e.g. 2,5- tetrahydrofuranediyl or 
2,5-dioxanediyl, or isoxazolidinene, preferably 3,5-isoxazolidinene. A particular heterocyclyl 
group considered is a saccharide moiety, e.g. an a- or p-furanosyl or a- or p-pyranosyl 

20 moiety. 

A linker R 4 may carry one or more same or different labels, e.g. 1 to 100 same or different 
labels, in particular 1 to 5, preferably one, two or three, in particular one or two same or 
different labels. 

25 

The label part L of the substrate can be chosen by those skilled in the art dependent on the 
application for which the fusion protein is intended. Labels may be e.g. such that the labelled 
fusion protein is easily detected or separated from its environment. Other labels considered 
are those which are capable of sensing and inducing changes in the environment of the 
30 labelled fusion protein and/or labels which aid in manipulating the fusion protein by the 

physical and/or chemical properties specifically introduced by the label to the fusion protein. 

Examples of labels L include a spectroscopic probe such as a fluorophore, a chromophore, a 
magnetic probe or a contrast reagent; a radioactively labelled molecule; a molecule which is 
35 one part of a specific binding pair which is capable of specifically binding to a partner; a 

molecule that is suspected to interact with other biomolecules; a library of molecules that are 
suspected to interact with other biomolecules; a molecule which is capable of crosslinking to 
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other molecules; a molecule which is capable of generating hydroxyl radicals upon exposure 
to H 2 0 2 and ascorbate, such as a tethered metal-chelate; a molecule which is capable of 
generating reactive radicals upon irradiation with light, such as malachite green; a molecule 
covalently attached to a solid support, where the support may be a glass slide, a microtiter 
5 plate or any polymer known to those proficient in the art; a nucleic acid or a derivative thereof 
capable of undergoing base-pairing with its complementary strand; a lipid or other 
hydrophobic molecule with membrane-inserting properties; a biomolecule with desirable 
enzymatic, chemical or physical properties; or a molecule possessing a combination of any 
of the properties listed above. 

10 

When the label L is a fluorophore, a chromophore, a magnetic label, a radioactive label or 
the like, detection is by standard means adapted to the label and whether the method is used 
in vitro or in vivo. The method can be compared to the applications of the green fluorescent 
protein (GFP) which is genetically fused to a protein of interest and allows protein 
1 5 investigation in the living cell. Particular examples of labels L are also boron compounds 
displaying non-linear optical properties, or a member of a FRET pair which changes its 
spectroscopic properties on reaction of the labelled substrate with the AGT fusion protein. 

Depending on the properties of the label L, the fusion protein comprising protein of interest 

20 and AGT may be bound to a solid support. The label of the substrate reacting with the fusion 
protein comprising AGT may already be attached to a solid support when entering into 
reaction with AGT, or may subsequently, i.e. after transfer to AGT, be used to attach the 
AGT fusion protein to a solid support. The label may be one member of a specific binding 
pair, the other member of which is attached or attachable to the solid support, either 

25 covalently or by any other means. A specific binding pair considered Is e.g. biotin and avidin 
or streptavidin. Either member of the binding pair may be the label L of the substrate, the 
other being attached to the solid support. Further examples of labels allowing convenient 
binding to a solid support are e.g. maltose binding protein, glycoproteins, FLAG tags, or 
reactive substituents allowing chemoselective reaction between such substituent with a 

30 complementary functional group on the surface of the solid support. Examples of such pairs 
of reactive substituents and complementary functional group are e.g. amine and activated 
carboxy group forming an amide, azide and a propiolic acid derivative undergoing a 1,3- 
dipolar cycloaddition reaction, amine and another amine functional group reacting with an 
added Afunctional linker reagent of the type of activated bis-dicarboxylic acid derivative 

35 giving rise to two amide bonds, or other combinations known in the art. 
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Examples of a convenient solid support are e.g. chemically modified oxidic surfaces, e.g. 
silicon dioxide, tantalum pentoxide, titanium dioxide, glass surfaces, e.g. glass slides, 
polymer surfaces, e.g. microtiter plates, in particular Hinctionalised polymers (e.g. in the form 
of beads), or also chemically modified metal surfaces, e.g. noble metal surfaces such as gold 
5 or silver surfaces, and suitable sensor elements made of any of the aforementioned 

materials. Irreversibly attaching and/or spotting AGT substrates may then be used to attach 
AGT fusion proteins in a spatially resolved manner, particularly through spotting, on the solid 
support representing protein microarrays, DNA microarrays or arrays of small molecules. 

10 When the label L is capable of generating reactive radicals, such as hydroxyl radicals, upon 
exposure to an external stimulus, the generated radicals can then inactivate the AGT fusion 
proteins as well as those proteins that are in close proximity of the AGT fusion protein, 
allowing to study the role of these proteins. Examples of such labels are tethered metal- 
chelate complexes that produce hydroxyl radicals upon exposure to H 2 0 2 and ascorbate, and 

15 chromophores such as malachite green that produce hydroxyl radicals upon laser irradiation. 
The use of chromophores and lasers to generate hydroxyl radicals is also known in the art as 
chromophore assisted laser induced inactivation (CALI). In the present invention, labelling 
AGT fusion proteins with chromophores such as malachite green and subsequent laser 
irradiation inactivates the AGT fusion protein as well as those proteins that interact with the 

20 AGT fusion protein in a time-controlled and spatially-resolved manner. This method can be 
applied both in vivo or in vitro. Furthermore, proteins which are in close proximity of the AGT 
fusion protein can be identified as such by either detecting fragments of that protein by a 
specific antibody, by the disappearance of those proteins on a high-resolution 2D- 
eiectrophoresis gels or by identification of the cleaved protein fragments via separation and 
. 25 sequencing techniques such as mass spectrometry or protein sequencing by N-terminal 
degradation. 

When the label L is a molecule that can cross-link to other proteins, e.g. a molecule 
containing functional groups such as maleimides, active esters or azides and others known 

30 to those proficient in the art, contacting such labelled AGT substrates with AGT fusion 

proteins that interact with other proteins {in vivo or in vitro) leads to the covalent cross-linking 
of the AGT fusion protein with its interacting protein via the label. This allows the 
identification of the protein interacting with the AGT fusion protein. Labels L for photo cross- 
linking are e.g. benzophenones. In a special aspect of cross-linking the label Lis a molecule 

35 which is itself an AGT substrate leading to dimerization of the AGT fusion protein. The 

chemical structure of such dimers may be either symmetrical (homodimers) or unsymmetrical 
(heterodimers). 
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Other labels L considered are for example fullerenes, boranes for neutron capture treatment, 
nucleotides or oligonucleotides, e.g. for self-addressing chips, peptide nucleic acids, and 
metal chelates, e.g. platinum chelates that bind specifically to DNA. 

5 

If the substrate carries two or more labels, these labels may be identical or different. 

The present invention provides a method to label AGT fusion proteins both in vivo as well as 
in vitro. The term in vivo labelling of a AGT fusion protein includes labelling in all 
1 0 compartments of a cell as well as of AGT fusion proteins pointing to the extracellular space. 
If the labelling of the AGT fusion protein is done in vivo and the protein fused to the AGT is a 
membrane protein, more specifically a plasma membrane protein, the AGT part of the fusion 
protein can be attached to either side of the membrane, e.g. attached to the cytoplasmic or 
the extracellular side of the plasma membrane. 

15 

If the labelling is done in vitro, the labelling of the fusion protein can be either performed in 
cell extracts or with purified or enriched forms of the AGT fusion protein. 

If the labelling is done in vivo or In cell extracts, the labelling of the endogenous AGT of the 
20 host is advantageously taken into account. If the endogenous AGT of the host does not 
accept O e -alkylguanine derivatives or related compounds as a substrate, the labelling of the 
fusion protein is specific. In mammalian cells, e.g. in human, murine, or rat cells, labelling of 
endogenous AGT is possible. In those experiments where the simultaneous labelling of the 
endogenous AGT as well as of the AGT fusion protein poses a problem, known AGT- 
25 deficient cell lines can be used. 

In a particular aspect, the present invention provides a method of determining the interaction 
of a candidate compound or library of candidate compounds with a target protein or library of 
target proteins. Examples of candidate compounds and target proteins include ligands and 

30 proteins, drugs and targets of the drug, or small molecules and proteins. In this particular 
method of the invention, the protein of interest fused to the AGT comprises a DNA binding 
domain of a transcription factor or an activation domain of a transcription factor. The putative 
protein target of the substances or library of proteins is linked to either of the DNA binding 
domain or the activation domain of the transcription factor in a way that a functional 

35 transcription factor can be formed, and the label L of the AGT substrate according to the 
invention is a candidate compound or library of candidate compounds suspected of 
interacting with the target substance or substances. The candidate compound or library of 
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candidate compounds being part of the substrate is then transferred to the AGT fusion 
protein. On transfer the AGT fusion protein(s) comprising the target substance(s) now are 
labelled with the candidate compound(s). The interaction of a candidate compound joined to 
the AGT fusion protein with the target protein fused to either the DNA binding domain or the 
5 activation domain leads to the formation of a functional transcription factor. The activated 
transcription factor can then drive the expression of a reporter which, if the method is carried 
out in cells, can be detected if the expression of the reporter confers a selective advantage 
on the cells. In particular embodiments, the method may involve one or more further steps 
such as detecting, isolating, identifying or characterising the candidate compound(s) or target 
10 substance(s). 

In a specific example the label L is a drug or a biological active small molecule that binds to a 
yet unidentified protein Y. A cDNA library of the organism which is expected to express the 
unknown target protein Y is fused to the activation domain of a transcription factor, and the 

1 5 AGT is fused to the DNA binding domain of a transcription factor, or alternatively, the cDNA 
library expected to express the unknown target protein Y is fused to the DNA binding domain 
of a transcription factor, and the AGT is fused to the activation domain of a transcription 
factor. Adding the AGT substrate of the invention comprising such a label L leads to the 
formation of a functional transcription factor and gene expression only in the case where this 

20 molecule binds to its target protein Y present in the cDNA library and fused to the activation 
domain or binding domain, respectively. If gene expression is coupled to a selective 
advantage, the corresponding host carrying the plasmid with the gene coding for the target 
protein Y of the drug or bioactive molecule can be identified. 

25 In a further specific example the label L is a library of chemical molecules. The library is 
expected to contain yet unidentified compounds that bind to a known drug target protein Y 
under in vivo conditions. The target protein Y is fused to the activation domain of a 
transcription factor and the AGT is fused to the DNA binding domain of a transcription factor, 
or alternatively, the target protein Y is fused to the DNA binding domain of a transcription 

30 factor and the AGT is fused to the activation domain of a transcription factor. Adding the 
substrate carrying the library of chemical compounds will lead to the covalent attachment of 
the chemical compounds of the library to the AGT, which is fused to either the DNA binding 
domain of a transcription factor or to the activation domain of a transcription factor, 
respectively. Interaction between a compound of the library (representing the label) attached 

35 to the AGT fusion protein and the target protein Y leads to the formation of a functional 
transcription factor and gene expression only in the case where the compound in the 
chemical library, linked through the covalent AGT-substrate bond, to either the DNA binding 
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domain of a transcription factor or to the activation domain of a transcription factor, binds to 
the target protein Y fused to the activation domain of a transcription factor or the DNA 
binding domain of a transcription factor, respectively. If gene expression is coupled to a 
selective advantage, those molecules of the library leading to the growth of the host can be 
5 identified. 

In the case where L is a bond connecting to Ri forming a cyclic substrate, a preferred 
compound is the cyclic substrate wherein the bond from R4 to R1 is a bond connecting the 
linker R4 to an amino group Re as defined under formula 2. In such a preferred cyclic 
10 substrate R 2 is preferably an oligonucleotide, i.e. a p-D-2-deoxyribosyl being incorporated 
into a single stranded oligodeoxyribonucleotide having a length of 2 to 99 nucleotides as 
detailed above. This oligonucleotide may be further chemically modified so that it can be 
detected and functions therefore as a label. The chemical modification of substituents might 
be of the same nature as mentioned above for the label L 

15 

In the case where L is a further group -R3-CHrX-R r R2, the substrate is a dimeric compound 
leading to a dimerised fusion protein on reaction with a fusion protein comprising AGT. 

Examples 

20 

Example 1 : Glutathion S-transferase (C) hAGT fusion protein 

hAGT is cloned between the BamH1 and EcoR1 sites of the expression vector pGEX2T 
(Pharmacia). Protein expression is carried out in E. coli strain JM83. An exponentially 
growing culture is induced with 1mM IPTG and the expression is carried out for 3.5 h at 

25 24°C. The harvested cells are resuspended in PBS supplemented with 1mM PMSF and 2 
pg/mL aprotinin and disrupted by lysozyme and sonification. To get rid of DNA, MgCI 2 is 
adjusted to 1mM and DNAse I is added to a concentration of 0.01 mg/mL The mixture is 
allowed to stand on ice for 30 min before cell debris are separated by centrifugation at 
40000 x g. The extract is applied to equilibrated glutathion sepharose which is then washed 

30 with one bed volume Tris HCI pH 8.5 and with 20 bed volumes PBS. GST-hAGT fusion 
protein is then eluted with 10 mM reduced glutathione in 50 mM TrisHCI pH 7.9. The 
purified protein is dialyzed against 50 mM HEPES pH 7.2; 1 mM DTT; 30 % glycerol and 
then stored at -B0°C. Purified GST-hAGT is incubated in vitro with O e -benzylguanine 
(Sigma) or 0 8 -4-bromothenylguanine. In a total reaction volume of 90 \xL, 0.4 jxM GST-hAGT 

35 are incubated with 2 jiM substrate in 50 mM HEPES pH 7.2; 1 mM DTT at room temperature. 
At several points of time an aliquot is quenched with 8.5 pmol 0 6 -benzylguanineoligo- 
nucleotide which is linked to a biotin group via the O 6 position (R. Damoiseaux et a/., Chem 
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Biochem 4: 285, 2001) for 10 min and mixed with SDS-Laemmli buffer for Western blotting 
analysis (neutravidin-peroxidase conjugate (PIERCE), Renaissance reagent plus (NEN)). 
The intensity of the corresponding bands is quantified by a Kodak Image Station 440. 

5 Exam ple 2: Orotldlne-ff-phosphate decarboxylase Ura3 (C ) hAGT fusion protein 

A plasmid is used which is based on the yeast shuttle vector pRS314 (Sikorski and Hieter, 
Genetics 122: 19-27, 1999). Between the BamH1 and EcoR1 restriction sites of pRS314 a 
copper inducible promoter (CU-promoter) is inserted. The Ura3 gene (with an N terminal 
HA-tag) is inserted between the Bglll and Kpnl sites, and hAGT is inserted between the 

10 EcoR1 and Bglll sites leading to a hAGT-Ura3 fusion protein. 

Expression levels of the hAGT-Ura3 fusion protein are monitored by inducing 5 mL of a 
culture with an OD 60 o of 0.3 with 0.1 mM CuS0 4 and incubating the culture for 3 h. 3 mL of 
the culture are harvested by centrifugation, resuspended in 50 pL 2 x Laemmli buffer and 
disrupted by 3 freeze-thaw cycles. Samples are loaded to a SDS-PAGE and Western 

1 5 blotting is performed (mouse HA.1 1 antibody (BABCO); peroxidase conjugated anti mouse 
antibody A4416 (Sigma); Renaissance reagent plus (NEN)). 

Activity of Ura3 is determined by growing transformants on plates containing CuS0 4 and 
lacking uracil. The activity of hAGT-Ura3 fusion protein is determined by an ELISA: 50 mL 
CM medium are supplemented with 0.1 mM CuSO* and 100 pM O e -benzylguanine, and 

20 inoculated with 5 mL of a stationary grown overnight culture. Protein expression is carried 
out for approximately 5 hours until the ODeoo reaches 1 .0. The harvested cells are 
resuspended in yeast lysis buffer (50 mM HEPES pH 7.5; 150 mM NaCI; 5 mM EDTA; 1% 
TX100; 1 mM DTT; 1 mM PMSF; 2 pg/mL aprotinin) and disrupted by 3 freeze-thaw cycles. 
300 pL of the resulting extract are incubated for 20 min with 5 pmol O e -benzylguanine- 

25 oligonucleotide which is linked to a biotin group via the O 6 position (R. Damoiseaux ef a/., 
ChemBiochem 4: 285, 2001), and then coated for 1 h to a previously blocked StreptaWell 
plate (Boehringer Mannheim). The ELISA is then developed with standard methods 
(detection by HA.1 1 and A4416 antibodies; development with peroxidase substrate ABTS 
(1 .0 mg/mL ABTS, 0,01% H 2 0 2 in 100 mM sodium citrate); readout at 405 nm). 

30 

Example 3: Ubiauitin (N) Ura3 (C) hAGT fusion protein 

To generate a hAGT with an N- terminal arginlne a linear ubiquitin-hAGT fusion protein is 
constructed by PGR where the construct is flanked with EcoR1 and Bglll restriction sites. 
The construct is inserted between the EcoR1 and Bglll sites of the construct hAGT-Ura3 
35 described in Example 2 leading to an ubiquitin-hAGT-Ura3 fusion protein. 

Expression levels of the ubiquitin-hAGT-Ura3 fusion protein and activity of the fusion protein 
obtained is monitored as described for hAGT-Ura3 in Example 2. 
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Example 4: Tud1 (N) w160 hAGT fusion protein 

Tup1 is involved in glucose repression of transcription (F.E. Williams and R. Trumbly, Mol 
Cell Biol 10: 6500-1 1 , 1990). This nuclear localized protein is fused to the N-terminus of 
5 w160 hAGT by the linker DHGSG, which contains the cloning site Nco I and connects the last 
amino acid Asn of Tup1 with the first amino acid Met of hAGT. For antibody detection the 
epitope HA is directly fused to the C-terminus of hAGT, followed by the stop codon. The 
primers for the cloning are ak121 (N, Tup1): 

5 , -GCATGAATTCATGACTGCCAGCGTTTCG-3 , (SEQ ID No. 1), ak122 (C, Tup1): 
1 0 5-GGATCCCCATGGTCATTTGGCGCTA I I I I I I IATAC-3' (SEQ ID No. 2), ak1 25 

(N, hAGT): S-CGTGACCATGGGAGTGGGATGGACAAGGATTGTGAAATG-S 1 (SEQ ID 
No. 3) and ak132 (C, HA): 5 , -GCATGGGTACCTTAAGCGTAATCTGGAACATCG-3 , (SEQ ID 
No. 4). A culture of L40 yeast cells, containing the expression vector p314AK1 in which the 
Tup1- Wi60 hAGT protein is under control of the p cup1 promoter, is grown to an OD^o of 0.6. 
1 5 Expression of Tup1 - w160 hAGT is induced by adding CuS0 4 to a concentration of 1 00 jiM and 
the cell culture is incubated for 2.5 h. After lysis of the yeast cells by freeze/thaw cycling the 
cell extract is analyzed for the presence of expressed Tup1- w160 hAGT fusion protein using 
Western Blotting (1. antiHA-antibody (Babco), 2. antimouse-peroxidase conjugate (Sigma)). 
The activity is verified by fluorescence microscopy, when the nuclear fusion protein is labeled 
20 with BGAF (0 6 -(p-aminomethyl)benzylguanine carrying a diacetate of 5(6)-carboxy- 
fluorescein residue connected by an amide bond to the p-aminomethyl group) in vivo. 

BGAF is prepared in the following way: 

6.0 mg (0.022 mmol) of 0 6 -(4-aminomethyl-benzyl)guanine are dissolved in 2 mL dry DMF 
25 (40°C, sonicated for 30 min) under argon atmosphere. After cooling to room temperature 
4.6 pL triethylamine (0.033 mmol) and 14.8 mg (0.027 mmol) of 5(6)-carboxyfIuorescein A/- 
succinimidyi ester (mixture of isomers) are added. After stirring 1 h at room temperature the 
solvent is removed and the products are purified by flash column chromatography using a 
stepwise gradient of methanol in dichloromethane (1:20, 1:10, 1 :5). Under these conditions 
30 both BGAF and the hydrolyzed derivative of BGAF (termed BGFL) are isolated, and are each 
dissolved in 400 pJL DMSO. The concentration of the solution of BGFL is determined by the 
absorption at X = 492 nm via the extinction coefficient of fluorescein (e^ = 98.4 x 10 3 M" 1 cm" 1 
at pH 7.4). The concentration of BGFL is calculated as 4.4 mM. Yield: 1.11 mg (0.0018 
mmol, 8 %). R, = 0.02 (methanol/dichloromethane 1/10). MS(ESI) 629.27 (100 [M+H] + ). 
35 C34H24N6O7 M = 628.61 g/mol. The concentration of BGAF is determined by the absorption 
at X = 280 nm using the added extinction coefficients of O e -(4-aminomethyl-benzyl)guanine 
and fluorescein (e^o = (7.1 + 53.3) mM"W 1 = 60.4 mM' 1 cm' 1 ). The concentration of BGAF 
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is calculated as 0.8 mM. Yield: 0.23 mg (0.3 nmol, 1.5 %). R, = 0.38 (methanol/dichloro- 
methane1/10). MS(ESI) 713.35 (100 [M+H] + ). C38H 28 Ne0 9 M = 712.68 g/mol. 

Example 5: Tup1 (H) Enhanced cyano fluorescent protein ECFP (C) w160 hAGT fusion protein 
5 Tup1 is fused to the N-terminus of w160 hAGT by the linker DHGSG as described in Example 
4. However, the epitope HA fused to the C-terminus of hAGT is followed by the fluorescent 
protein ECFP. The primers for the cloning are ak121 (N, Tup1) (SEQ ID No. 1), ak122 (C, 
Tup1) (SEQ ID No. 2), ak125 (N, hAGT) (SEQ ID No. 3), ak126 (ECFP. HA): 
S' -CTCGCCCTTGCTCACCAT CCCGCTGCCGGACnnAnnfiTAATHTCnAACATCG^- 
10 (SEQ ID No. 5), ak127 (ECFP . HA): 

5 > -CGATGTTCCAGATTACGCTGGGTCCGGCAGCGGG ATGGTGAGCAAGGGCGAG- 3 , 
(SEQ ID No. 6) and ak128 (C, ECFP): 

S-CTAGCTGGGTACCGTT ACTTGTACAGCTCGTCCATGA -S' (SEQ ID No. 7). 
A culture of L40 yeast cells, containing the expression vector p314AK1 in which the 

15 Tup 1- w160 hAGT-ECFP protein is under control of the Pcupi promoter, is grown to an OD 6 oo of 
0.6. Expression of Tup 1- w160 hAGT-ECFP is induced by adding CuS0 4 to a concentration of 
100 jxM and the cell culture is incubated for 2.5 h. After lysis of the yeast cells by 
freeze/thaw cycling the cell extract is analyzed for the presence of expressed 
Tup 1- w160 hAGT-ECFP fusion protein using Western Blotting (1. antiHA-antibody (Babco), 

20 2. antimouse-peroxidase conjugate (Sigma)). The activity is verified by fluorescence 

microscopy, when the nuclear fusion protein is labeled with BGAF in vivo and the nucleus is 
distinguished from the residual cell. 

Example 6: LexA (C^ hAGT fusion protein 

25 LexA is the DNA-binding domain of an E coli transcription regulator used in the yeast-two 
hybrid approach. The hAGT is fused to its C-terminus, in-between the restriction sites EcoR 
/ and Not I of the yeast expression vector pHybLexZeo (Invitrogen). The primers used are 
ak101 (N, hAGT): 5 , -CGATACGAATTCATGGACAAGGATTGTGAAATGAAACGC-3 , (SEQ 
ID No. 8), and ak102 (C, hAGT): 5-TTCATAGCGGCCGCGTCAGTTTCGGCCAGCAGGC-3' 

30 (SEQ ID No. 9). 

Example 7: Cytochrome C peroxidase CCP (C) hAGT fusion protein 
In the hAGT-Ura3 construct (Example 2) Ura3 is replaced by CCP (without its mitochondrial 
targeting sequence) carrying the mutations D217P and D224Y (Iffland ef a/., Biochem 
35 Biophys Res Commun 286: 126-132, 2001). To test the activity of CCP as afusion protein, 
yeast colonies transformed with the vector leading to expression of hAGT-CCP are 
transferred to nitrocellulose and (after 3 freeze-thaw cycles) exposed to 5 or 20 mM ABTS in 
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50 mM KH 2 P0 4 buffer containing 0.02 % H 2 0 2 . The colonies stained dark green within 
minutes whereas colonies not expressing the protein only stained very faintly. 

Example 8: Enhanced cyano fluorescent protein ECFP (C) w160 hAGT fusion protein 
5 The fluorescent protein ECFP is fused to the C-terminus of w160 hAGT, followed by a stop 
codon. The fusion by PCR is performed with the same primers as for the fusion protein 
Tup 1- w160 hAGT-ECFP (Example 5). The protein w160 hAGT-ECFP is incorporated into the 
mammalian expression vector pNuc (Clontech) between the restriction sites Nhe I and 
BamHI. 

10 CHO cells deficient in AGT are transfected with a vector encoding w160 hAGT-ECFP. After 
24 h of transient expression, cells grown on 0.18 mm thick glass slides are transferred to a 
perfusion chamber and incubated with BGFL (5 nM) for 5 min. Cells are washed three times 
with PBS buffer to remove excess substrate. For the fluorescence measurements a Zeiss 
LSM510 laser scanning confocal microscope is used (Cart Zeiss AG). Detection of 

15 fluorescein or ECFP signals (excitation at 488 nm) is achieved by appropriate filters. 

Scanning speed and laser intensity are adjusted to avoid photobleaching of the fluorescent 
probes, and damage or morphological changes of the cells. 



Example 9: Membrane protein of the ER Sec62 / DHFR (C) hAGT fusion protein 

20 Fragments encoding the ORF (open reading frame) of the N-terminal domain of the protein 
Sec62p, the full-length ORF of the peroxisomal membrane proteins Pex10p and Pex15p, and 
the ORF of an N-terminal fragment of the yeast casein kinase (YCK1) are obtained by PCR 
using yeast genomic DNA as a template and an oligonucleotide primer complementary to the 
5' and 3' ends of the desired DNA fragments respectively. All 5 -primers contain an 

25 additional BamHI site and all 3-primers an additional restriction site to allow for the in-frame 
fusion 3* to the CUP 1 -hAGT module on a pRS31 4 vector or for the DNA fragment of YCK1 
on a pRS304 vector. The ORF of the N-terminal domain of the protein Sec62p is inserted in 
frame between the CUP1-hAGT module and the sequence encoding the mouse 
dihydrofolate reductase (DHFR) that is extended by an additional sequence encoding for the 

30 HA epitope tag (Dha). The CUP1-hAGT module is obtained by PCR using a plasmid DNA 
containing the full length AGT as a template and an oligonucleotide primer complementary to 
the 5' and 3' ends of the ORF of hAGT. The 3 -primers contain an additional BamHI site and 
the 5 -primer an additional EcoRI site to allow for the fusion 3' to the yeast CUP1 promotor on 
a pRS314 and pRS304 vector. The plasmids CUP1-hAGT-SEC62-314, CUP1-hAGT- 

35 PEX10-314 and CUP1-hAGT-PEX15-314 are transformed into yeast cells. The presence of 
the plasmids are controlled by the growth on selective media lacking tryptophan. To obtain 
the full length version of the hAGT-YCK1 fusion gene, CUP1 -hAGT-YCK1 -304 is cut with 
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SaH to allow for homologous recombination with the chromosomal YCK1 after 
transformation of the cut plasmid into yeast. Successful recombination is verified by 
diagnostic PCR using the appropriate oligonucleotides as primers. 
Functional assay of the hAGT-Sec62-Dha fusion protein: 100 mL of S. cerevisiae cells 
5 expressing hAGT-Sec62-Dha are grown at 30°C to an ODeoo of -0.5 and supplemented with 
100 yM CuS0 4 4 hours prior to cell extraction. After centrifugation the cells are opened by 
grinding in fluid nitrogen and the proteins are extracted in buffer containing 150 Mm NaCI, 
20 mM HEPES pH 7.5 , 1 mM EDTA and a protease inhibitor cocktail (Boehringer 
Mannheim, Germany). After a 15 min centrifugation at 20.000 rpm at 4°C, the cleared 

10 extracts are treated with 10 pmol of an oligonucleotide containing the substrate BGBT for 20 
min at room temperature. The cells extracts are incubated with 15 pL of Dynabeads for 4 
hours and the beads are washed five times with 1 ml of extraction buffer. The washed beads 
are boiled in 30 \il of Laemmli buffer and the extract is subjected to SDS PAGE. The 
purified hAGT-Sec62-Dha is detected after Western blotting onto nitrocellulose by 

15 consecutive incubation with mouse monoclonal HA antibody and horseradish peroxidase- 
coupled rabbit anti-mouse antibody. 

Example 10: Serotonin receptor 5-HT3 (N) hAGT fusion protein 

The vector pEAK8-5HT 3 R containing the serotonin receptor 5-HT 3 (mouse) was provided by 
20 the group of H. Vogel (EPFL Lausanne, Switzerland). ^^hAGT is incorporated into the 
fourth loop (cytoplasmatic) of the receptor between the restriction sites SnaB I and Pac /, 
which had been introduced by mutagenesis. The primers for the amplification of the 
w160 hAGT are ak144 (N, w160 hAGT): 

5'-GCATGCTACGTAATGGACAAGGATTGTGAAATG-3' (SEQ ID No. 10), ak145 
25 (C, w160 hAGT): 5 , -GAGCACTTAATTAAGTTTCGGCCAGCAGGCGG-3 , (SEQ ID No. 11). 
CHO cells deficient in AGT are transfected with a vector encoding 5-HT 3 -{ w160 hAGT) toop4 - 
receptor. After 24 h of transient expression, cells grown on 0.18 mm thick glass slides are 
transferred to a perfusion chamber and incubated with BGFL (5 jxM) for 5 min. Cells are 
washed three times with PBS buffer to remove excess substrate. For the fluorescence 
30 measurements a Zeiss LSM510 laser scanning confocal microscope is used (Carl Zeiss AG). 
Detection of fluorescein signals (excitation at 488 nm) is achieved by appropriate filters. 
Scanning speed and laser intensity are adjusted to avoid photobleaching of the fluorescent 
probes, and damage or morphological changes of the cells. 

35 Example 11: Human estrogen receptor hER (C) hAGT fusion protein 

The vector pC1-hER containing the human estrogen receptor was provided by the group of 
H. Vogel (EPFL, Lausanne, Switzerland). w160 hAGT is fused to the C-terminus of the 
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receptor between the restriction sites Nhe I and Xho /. The primers for the amplification of 
the w160 hAGT are ak136 (N, Wi60 hAGT): 

5 , -ATCGAGCTAGCGCTACCGGTCGCCACCATGGACAAGGATTGTGAAATG-3 , (SEQ ID 
No. 12) and ak151 (C, w160 hAGT): S-CGTAGCTCGAGAGTUCGGCCAGCAGGC-S 1 (SEQ 
5 ID No. 13). CHO cells deficient in AGT are transfected with a vector encoding Wi60 hAGT- 
hER. After 24 h of transient expression, cells grown on 0.1 8 mm thick glass slides are 
transferred to a perfusion chamber and incubated with BGFL (5 \M) for 5 min. Cells are 
washed three times with PBS buffer to remove excess substrate. For the fluorescence 
measurements a Zeiss LSM510 laser scanning confocal microscope is used (Carl Zeiss 
10 AG). Detection of fluorescein signals (excitation at 488 nm) is achieved by appropriate 
filters. Scanning speed and laser intensity are adjusted to avoid photobleaching of the 
fluorescent probes, and damage or morphological changes of the cells. 
The labeling of the fusion protein w160 hAGT-hER located in the nucleus is verified. The 
nucleus is clearly distinguishable from the rest of the cell, 

15 

Example 12: SV40 large T antigen nuclear localization sequence NLS (C) hAGT and NLS / 
ECFP (C) hAGT 

The three copies of the nuclear localization signal (NLS 3 ) of the simian virus 40 large T- 
antigen are either fused at the C-terminus of the fluorescent protein ECFP fused to a HA-tag 

20 fused to the C-terminus of w160 hAGT yielding w160 hAGT-HA-ECFP-NLS 3 , or are fused directly 
to the C-terminus of w160 hAGT yielding w160 hAGT-NLS 3 . The fusion by PCR is performed 
with the same primers as for the fusion protein Tup 1- w160 hAGT-ECFP (Example 5). Then 
w160 hAGT-HA-ECFP-NLS 3 or w160 hAGT-NLS 3 , respectively, are incorporated into the 
mammalian expression vector pNuc (Clontech) between the restriction sites Nhe I and fig/ //. 

25 The primers are ak136 (N, w160 hAGT) (SEQ ID No. 12), ak137 (C, ECFP): 

S-CATGCAGATCTGAGTCCGGACTTGTACAGCTC-S' (SEQ ID No. 14) and ak107 
(C, w160 hAGT): S'-CCAGGCAGATCTGTTTCGGCCAGCAGGCGGGG-S' (SEQ ID No. 15). 
CHO cells deficient in AGT are transfected with the vector pNuc encoding w160 hAGT-HA- 
ECFP-NLS 3 or alternatively w160 hAGT-NLS 3 . After 24 h of transient expression, cells grown 

30 on 0.18 mm thick glass slides are transferred to a perfusion chamber and incubated with 
BGFL (5 nM) for 5 min. Cells are washed three times with PBS buffer to remove excess 
substrate. For the fluorescence measurements a Zeiss LSM510 laser scanning confocal 
microscope is used (Carl Zeiss AG). Detection of fluorescein or ECFP signals (excitation at 
488 nm) is achieved by appropriate filters. Scanning speed and laser intensity are adjusted 

35 to avoid photobleaching of the fluorescent probes, and damage or morphological changes of 
the cells. 



WO 2004/031404 



PCT/EP2003/010859 



-24- 

Claims 

1 . A labelled AGT fusion protein comprising a protein of interest selected from the group 
consisting of enzymes, DNA-binding proteins, transcription regulating proteins, membrane 

5 proteins, nuclear receptor proteins, nuclear localization signal proteins, protein cofactors, 
small monomeric GTPases, ATP-binding cassette proteins, intracellular structural proteins, 
proteins with sequences responsible for targeting proteins to particular cellular 
compartments, proteins generally used as labels or affinity tags, and domains or subdomains 
of the aforementioned proteins, with the proviso that the major head protein D of phage X 
10 (gpD), and the proteins MHHHHHHSSA, DHFR-HA, V5-NLS-B42, HA-Ura3 and SSN6 are 
excluded. 

2. The labelled AGT fusion protein according to claim 1 wherein the protein of interest is a 
membrane protein. 

15 

3. The labelled AGT fusion protein according to claim 1 wherein the protein of interest is a 
kinase. 

4. The labelled AGT fusion protein according to claim 1 wherein the protein of interest is a 
20 nuclear receptor protein. 

5. The labelled AGT fusion protein according to claim 1 wherein the protein of interest is a 
phosphatase. 

25 6. The labelled AGT fusion protein according to claim 1 wherein the protein of interest is a 
protease. 

7. The labelled AGT fusion protein according to claim 1 which consists of one or more 
proteins of interest fused to AGT at the C- or N- and C-terminal of AGT and a substrate 

30 carrying a label. 

8. The labelled AGT fusion protein according to claim 1 wherein AGT is a variant of human 
AGT with one or more amino acid substitution, deletion or addition. 

35 9. The labelled AGT fusion protein according to claim 8 wherein AGT is a variant wherein 
Asn 157 is replaced by Gly and Ser 159 by Glu, and the one wherein Gly 160 is replaced by Ala or 
Trp. 
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10. The labelled AGT fusion protein according to claim 8 wherein AGT is a variant wherein 
Asn 157 is replaced by Ser f Ser 159 by His, and Gly 160 by Asn. 

5 11. The labelled AGT fusion protein according to claim 1 wherein the label is a 

spectroscopic probe; a radioactively labelled molecule; a molecule which is one part of a 
specific binding pair which is capable of specifically binding to a partner; a molecule that is 
suspected to interact with other biomolecules; a library of molecules that are suspected to 
interact with other biomolecules; a molecule which is capable of crosslinking to other 

10 molecules; a molecule which is capable of generating hydroxyl radicals upon exposure to 
H 2 0 2 and ascorbate; a molecule which is capable of generating reactive radicals upon 
irradiation with light; a molecule covalently attached to a solid support; a nucleic acid or a 
derivative thereof capable of undergoing base-pairing with its complementary strand; a lipid 
or other hydrophobic molecule with membrane-inserting properties; a biomolecule with 

15 desirable enzymatic, chemical or physical properties; or a molecule possessing a 
combination of any of the properties listed above. 

12. The labelled AGT fusion protein according to claim 1 1 wherein the label is a fluorophore, 
a chromophore, a magnetic probe or a contrast reagent. 

20 

13. The labelled AGT fusion protein according to claim 12 wherein the label is a fluorophore. 

14. The labelled AGT fusion protein according to claim 1 1 wherein the label is a molecule 
which is one part of a specific binding pair which is capable of specifically binding to a 

25 partner. 

15. The labelled AGT fusion protein according to claim 1 1 wherein the label is a molecule 
which is capable of crosslinking to other molecules. 

30 16. The labelled AGT fusion protein according to claim 1 1 wherein the label is a molecule 
attached to a solid support. 

17. The labelled AGT fusion protein according to claim 16 wherein the solid support is a 
chemically modified oxidic surface, glass surface, polymer surface, functionalised polymer, 
35 noble metal surface. 
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18. The labelled AGT fusion protein according to claim 17 wherein the solid support is in the 
form of a bead, microtiter plate or sensor element. 

19. The labelled AGT fusion protein according to claim 1 1 wherein the label is a nucleic acid 
5 or a derivative thereof capable of undergoing base-pairing with its complementary strand. 

20. The labelled AGT fusion protein according to claim 1 comprising a plurality of labels. 

21. An AGT fusion protein comprising a protein of interest selected from the group 

10 consisting of enzymes, DNA-binding proteins, transcription regulating proteins, membrane 
proteins, nuclear receptor proteins, nuclear localization signal proteins, protein cofactors, 
small monomeric GTPases, ATP-binding cassette proteins, intracellular structural proteins, 
proteins with sequences responsible for targeting proteins to particular cellular 
compartments, proteins generally used as labels or affinity tags, and domains or subdomains 

15 of the aforementioned proteins, with the proviso that the major head protein D of phage X 
(gpD), and the proteins MHHHHHHSSA, DHFR-HA, V5-NLS-B42, HA-Ura3 and SSN6 are 
excluded. 

22. The AGT fusion protein according to claim 21 wherein the protein of interest is a 
20 membrane protein. 

23. The AGT fusion protein according to claim 21 wherein the protein of interest is a kinase. 

24. The AGT fusion protein according to claim 21 wherein the protein of interest is a nuclear 
25 receptor protein. 

25. The AGT fusion protein according to claim 21 wherein the protein of interest is a 
phosphatase. 

30 26. The AGT fusion protein according to claim 21 wherein the protein of interest is a 
protease. 

27. The AGT fusion protein according to claim 21 which consists of one or more proteins of 
interest fused to AGT at the N-, C- or N- and C-terminal of AGT and a substrate carrying a 
35 label. 
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28. The AGT fusion protein according to claim 21 wherein AGT is a variant of human AGT 
with one or more amino acid substitution, deletion or addition. 

29. The AGT fusion protein according to claim 28 wherein AGT is a variant wherein Asn 157 is 
5 replaced by Gly and Ser 159 by Glu, and the one wherein Gly 160 is replaced by Ala or Trp. 

30. The AGT fusion protein according to claim 28 wherein AGT is a variant wherein Asn 157 is 
replaced by Ser, Ser 159 by His, and Gly 160 by Asn. 

10 31 . A variant of human AGT wherein Asn 157 is replaced by Gly and Ser 159 by Glu, or wherein 
Gly 160 is replaced by Ala or Trp, or wherein Asn 157 is replaced by Ser, Ser 159 by His, and 
Gly 160 by Asn. 

32. A method for detecting and manipulating a protein of interest, characterized in that the 
15 protein of interest incorporated into an AGT fusion protein is contacted with a suitable AGT 

substrate carrying a label, and the AGT fusion protein is detected and optionally further 
manipulated using the label in a system designed for recognising or handling the label. 

33. The method according to claim 32 further comprising the step of forming an AGT fusion 
20 protein from the protein of interest and AGT. 

34. The method according to claim 32 wherein the protein of interest is selected from the 
group consisting of enzymes, DNA-binding proteins, transcription regulating proteins, 
membrane proteins, nuclear receptor proteins, nuclear localization signal proteins, protein 

25 cofactors, small monomelic GTPases, ATP-binding cassette proteins, intracellular structural 
proteins, proteins with sequences responsible for targeting proteins to particular cellular 
compartments, proteins generally used as labels or affinity tags, and domains or subdomains 
of the aforementioned proteins, with the proviso that the major head protein D of phage X 
(gpD), and the proteins MHHHHHHSSA, DHFR-HA, V5-NLS-B42, HA-Ura3 and SSN6 are 

30 excluded. 

35. The method according to claim 34 wherein the protein of interest is a membrane protein. 

36. The method according to claim 34 wherein the protein of interest is a kinase. 

35 

37. The method according to claim 34 wherein the protein of interest is a nuclear receptor 
protein. 
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38. The method according to claim 34 wherein the protein of interest is a phosphatase. 

39. The method according to claim 34 wherein the protein of interest is a protease. 

5 

40. The method according to claim 32 wherein the AGT fusion protein consists of one or 
more proteins of interest fused to AGT at the N-, C- or N- and C-terminal of AGT. 

41 . The method according to claim 32 wherein AGT in the AGT fusion protein is human AGT 
10 or a variant of human AGT with one or more amino acid substitution, deletion or addition. 
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<213> Artificial Sequence 
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<223> synthetic primer 
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gcatgaattc atgactgcca gcgtttcg 

<210> 2 
<211> 36 
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<223> synthetic primer 
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ggatccccat ggtcatttgg cgctattttt ttatac 
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<223> synthetic primer 
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<210> 4 
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<223> synthetic primer 

<400> 4 
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<220> 

<223> synthetic primer 
<400> 6 

cgatgttcca gattaegctg ggtccggcag cgggatggtg ageaagggeg ag 52 
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Page 4 



WO 2004/031404 



PCT/EP2003/010859 



SQL P303A.ST25 

<220> 

<223> synthetic primer 
<400> 14 

catgcagatc tgagtccgga cttgtacagc tc 

<210> 15 

<211> 32 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> synthetic primer 

<400> 15 

ccaggcagat ctgtttcggc cagcaggcgg gg 



Page 5 



INTERNATIONAL SEARCH REPORT 



Internationa} Application No 

PCT/EP 03/10859 



A. CLASSIFICATION OF .SUBJECT MATTER 

IPC 7 C12Q1/48 G01N33/535 C12N9/10 



According lo International Paieni Classification (IPC) or lo both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

IPC 7 C12Q G01N C12N 



Documenlalion searched other I nan minimum documentation to the extent (hat such documents are Included In I he fields searched 



Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 

EPO-Internal , MEDLINE, EMBASE, BIOSIS, WPI Data 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category * Citation of document, with Indication, where appropriate, of the relevant passages 



Relevant lo claim No. 



AY I TECK CHOON ET AL: "Intracellular 
localization of human DNA repair enzyme 
methyl guan1ne-DNA methyl transferase by 
antibodies and Its Importance." 
CANCER RESEARCH, 

vol. 52, no. 23, 1992, pages 6423-6430, 

XP001146124 

ISSN: 0008-5472 

abstract 

page 6423, column 2, 
6424, column 2, line 
page 6427, column 2, 
page 6428, column 1, paragraph 3 -page 
6429, column 1, paragraph 1 



1-15, 
21-31 



16-20 



paragraph 3 -page 
1 

paragraph 2; figure 6 



-/-- 



m 



Further documents are listed in the continuation of box C. 



Patent family members are listed in annex. 



0 Special categories of cited documents : 

"A" document defining the general state of the art which is not 

considered to be of particular relevance 
'E' earlier document but published on or after the International 

filing date 

V document which may throw doubts on priority claim(s) or 
which Is cited lo establish the publication date of another 
citation or other special reason (as specified) 

'O* document referring to an oral disclosure, use, exhibition or 
other means 

"P* document published prior to the international filing date but 
later than the priority date claimed 



a T* later document published after the International filing dale 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

'X* document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

'Y* document of particular relevance; the claimed invention 
cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

*&' document member of the same patent family 



Dale of the actual completion of the international search 



11 March 2004 



Date of mailing of the international search report 



26/03/2004 



Name and mailing address of the ISA 

European Patent Office, P.B. 581 6 Patent laan 2 
NL-2280 HV Rijswijk 
Tel. (+31-70) 340-2040. Tx. 31 651 eponl. 
Fax: (+31-70) 340-3016 



Authorized officer 



Bradbrook, D 



Form PCT71SA/210 (second sheet) (July 1992) 



INTERNATIONAL SEARCH REPORT 



International Application No 

PCT/EP 03/10859 



C(Contlnuatlon) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category • Citation of document, with Indication, where appropriate, of the relevant passages 



Relevant to claim No. 



ISHIBASHI TORI) ET AL: "Artificial control 

of nuclear translocation of DNA repair 

methyl transferase." 

JOURNAL OF BIOLOGICAL CHEMISTRY, 

vol. 269, no. 10, 1994, pages 7645-7650, 

XP001146125 

ISSN: 0021-9258 

abstract 

page 7646, column 2, paragraph 5 -page 
7648, column 1, paragraph 1; figures 1-4 
page 7648, column 2, paragraph 4 -page 
7650, column 1, paragraph 1 

HANSEN WARREN K ET AL: "Creation of a 
fully functional human chimeric DNA repair 
protein. Combining 06-methyl guanine DNA 
methyl transferase (MGMT) and AP 
endonuclease (APE/redox effector factor 1 
(Ref 1)) DNA repair proteins." 
JOURNAL OF BIOLOGICAL CHEMISTRY, 
vol. 273, no. 2, 

9 January 1998 (1998-01-09), pages 
756-762, XP002232901 
ISSN: 0021-9258 
abstract 

page 757, column 1, paragraph 3 
page 757, column 2, paragraph 6 
column 2, paragraph 3 
page 761, column 2, paragraph 2 -page 762, 
column 1, paragraph 2 



-page 759, 



MORGAN SUSAN E ET AL: "The role of the 
carboxyl -terminal tall in human 
0-6-methyl guanine DNA methyl transferase 
substrate specificity and temperature 
sensitivity. " 

JOURNAL OF BIOLOGICAL CHEMISTRY, 

vol. 268, no. 26, 1993, pages 19802-19809, 

XP002232902 

ISSN: 0021-9258 

abstract 

page 19803, column 1, paragraph 2 -column 
2, paragraph 1 

page 19804, column 1, paragraph 3 -page 
19808, column 1, paragraph 1 
page 19808, column 2, paragraph 2 

V- 



1-15, 
21-41 



16-20 



21-31 



21-31 



Form PCT71SA/210 (continuation ol second sheet) (July 1992) 



INTERNATIONAL SEARCH REPORT 



International Application No 

PCT/EP 03/10859 



C .(Continuation) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category • Citation of document, with indication, where appropriate, of the relevant passages 



CHOI UIM00K ET AL: "Green Fluorescent 
Protein-methyl guanine methyl transferase 
fusion protein demonstrates nuclear 
localization and confers drug resistance: 
An Informative model for the development 
of drug selectable gene therapy." 
BLOOD, 

vol. 98, no. 11 Part 2, 

16 November 2001 (2001-11-16), page 403b 

XP009006490 

43rd Annual Meeting of the American 
Society of Hematology, Part 2;0rlando, 
Florida, USA; December 07-11, 2001, 
November 16, 2001 
ISSN: 0006-4971 
abstract 

LIEM LIE-KUO ET AL: "Factors Influencing 

the repair of the mutagenic lesion 

0-6-methyl guanine 1n DNA by human 

0-6-methyl guanl ne-DNA methyl tr ans f erase . " 

JOURNAL OF MOLECULAR BIOLOGY, 

vol. 231, no. 4, 1993, pages 950-959, 

XP002066517 

ISSN: 0022-2836 

abstract 

page 953, column 2, paragraph 2 -page 958, 
column 2, paragraph 2 

SRIVENUGOPAL K S ET AL: "PHOSPHORYLATION 

OF 06-ALK YLGUAN I NE-DNA ALKYLTRANSFERASE: 

EXPERIENCE WITH A GST-FUSION PROTEIN AND A 

NEW PULL-DOWN ASSAY" 

CANCER LETTERS, NEW YORK, NY, US, 

vol. 181, no. 1, 8 July 2002 (2002-07-08), 

pages 87-93, XP001 145706 

ISSN: 0304-3835 

page 88, column 2, paragraph 1 -page 89, 
column 1, paragraph 1 
column 2, paragraph 2 

DE 199 03 895 A (JOHNSS0N KAI) 
3 August 2000 (2000-08-03) 
figures 1-3 

-/-- 



Relevant to claim No. 



1-15, 
21-31 



16-20 
21-31 



1-15, 
21-31 



16-20 



Form PCT/lSA/210 {ccnimuaflton ol second sheet) (July 1992) 



INTERNATIONAL SEARCH REPORT 



International Application No 

PCT/EP 03/10859 



C(Contfnuatlon) DOCUMENTS CONSIDERED TO BE RELEVANT 


Category ° 


Citation of document, with indication. where appropriate, of the relevant passages 


Relevani lo claim No. 


Y 


DAMOISEAUX R ET AL: "SYNTHESIS AND 

APPLICATIONS OF CHEMICAL PROBES FOR HUMAN 

06-ALKYLGUANINE-DNA ALKYLTRANSFERASE" 

CHEMBIOCHEM - A EUROPEAN JOURNAL OF 

CHEMICAL BIOLOGY, WILEY VCH, WEINHEIM, DE, 

vol. 2, no. 4, 2 April 2001 (2001-04-02), 

pages 285-287, XP001148325 

ISSN: 1439-4227 

cited 1n the application 

page 285, column 1, paragraph 1 -column 2, 

paragraph 2 


16-20 


A 


XU-WELLIVER MENG ET AL: "Role of codon 
160 1n the sensitivity of human 
06-alkylguanlne-DNA alkyl transferase to 
06-benzyl guanine" 
BIOCHEMICAL PHARMACOLOGY, 
vol. 58, no. 8, 

15 October 1999 (1999-10-15), pages 
1279-1285, XP002273209 
ISSN: 0006-2952 
abstract 


8-10, 1 
28-31 


A 


XU-WELLIVER MENG ET AL: "Point mutations 
at multiple sites Including highly 
conserved amino adds maintain activity, 
but render 06-alkylguan1ne-DNA 
alkyltransferase Insensitive to 
06-benzyl guanine" 
BIOCHEMICAL JOURNAL, 
vol. 347, no. 2, 

15 April 2000 (2000-04-15), pages 519-526, 
XP002273210 
ISSN: 0264-6021 
abstract; tables 2,3,5 


8-10, 
28-31 


P,X 


W0 02 083937 A (KEPPLER ANTJE ;EC0LE 
P0LYTECH (CH); J0HNSS0N KAI (CH); 
GENDREIZIG) 24 October 2002 (2002-10-24) 
cited 1n the application 
the whole document 


1-31 


P,X 


KEPPLER ANTJE ET AL: "A general method 
for the covalent labeling of fusion 
proteins with small molecules 1n vivo." 
NATURE BIOTECHNOLOGY, 
vol. 21, no. 1, 

20 January 2003 (2003-01-20), pages 86-89, 
XP002231034 
ISSN: 1087-0156 
the whole document 


1-31 



Forni PCT/ISA/210 (continual ion of second sheet) (July 1992) 



INTERNATIONAL SEARCH REPORT 

Information on patent family members 



International Application No 

PCT/EP 03/10859 



Patent document 
cited in search report 



Publication 
date 



Patent family 
member(s) 



Publication 
date 



DE 19903895 



03-08-2000 DE 



19903895 Al 



03-08-2000 



W0 02083937 



24-10-2002 



CA 
W0 



2443570 Al 
02083937 A2 



24-10-2002 
24-10-2002 



Form PCT/1SA/210 (patent family annex) (July 1092) 



